Schema mapping generation in the wild

نویسندگان

چکیده

Schema mappings enable declarative and executable specification of transformations between different schematic representations application concepts. Most work on mapping generation has assumed that the source target schemas are well defined, e.g., with declared keys foreign keys, processes exist to support data engineer in labour-intensive process producing a high-quality integration. However, organizations increasingly have access numerous independently produced datasets, lake, requirement produce rapid, best-effort integrations, without extensive manual effort. As result, there is need generate settings relationships, thus basis inferred profiling data, over large numbers sources. Our contributions include dynamic programming algorithm for exploring space potential mappings, techniques propagating through so fitness candidate can be estimated. The paper also describes how resulting used populate single multi-relation schemas. Experimental results show effectiveness scalability approach variety synthetic real-world scenarios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Inverse of a Schema Mapping

The inversion of schema mappings has been identified as one of the fundamental operators for the development of a general framework for data exchange, data integration, and more generally, for metadata management. Given a mappingM from a schema S to a schema T, an inverse ofM is a new mapping that describes the reverse relationship from T to S, and that is semantically consistent with the relat...

متن کامل

Designing a Knowledge-based Schema Matching System for Schema Mapping

Schema mapping that provides a unified view to the users is necessary to manage schema heterogeneity among different data sources. Schema matching is a required task for schema mapping that finds semantic correspondences between entity pairs of schemas. Semi-automatic schema matching systems were developed to overcome manual works for schema mapping. However, such approaches require a high manu...

متن کامل

Λ Φ I A Abstraction Mapping Implementation Mapping Q Q ’ Logical Schema Physical Schema

We present an optimization method and algorithm designed for three objectives: physical data independence, semantic optimization, and generalized tableau minimization. The method relies on generalized forms of chase and \backchase" with constraints (dependencies). By using dictionaries ((nite functions) in physical schemas we can capture with constraints useful access structures such as indexes...

متن کامل

Distributed Key Generation in the Wild

Distributed key generation (DKG) has been studied extensively in the cryptographic literature. However, it has never been examined outside of the synchronous setting, and the known DKG protocols cannot guarantee safety or liveness over the Internet. In this work, we present the first realistic DKG protocol for use over the Internet. We propose a practical system model for the Internet and defin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Systems

سال: 2022

ISSN: ['0306-4379', '1873-6076']

DOI: https://doi.org/10.1016/j.is.2021.101904